A generalized approach to complex networks 
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Abstract 

This work describes how the formaUzation of complex network concepts in terms of discrete mathemat- 
ics, especially mathematical morphology, allows a series of generalizations and important results ranging 
from new measurements of the network topology to new network growth models. First, the concepts of 
node degree and clustering coefficient are extended in order to characterize not only specific nodes, but any 
generic subnetwork. Second, the consideration of distance transform and rings are used to further extend 
those concepts in order to obtain a signature, instead of a single scalar measurement, ranging from the 
single node to whole graph scales. The enhanced discriminative potential of such extended measurements 
is illustrated with respect to the identification of correspondence between nodes in two complex networks, 
namely a protein-protein interaction network and a perturbed version of it. The use of other measurements 
derived from mathematical morphology are also suggested as a means to characterize complex networks 
connectivity in a more comprehensive fashion. 

PACS numbers: 02.70.Rr,02.10.Ox,89.75.Hc 



I. INTRODUCTION achieved by using 



powerful tools from theoreti- 



cal physics (e.g. | H 0]), relatively little atten- 
One of the unavoidable consequences of ^ion has been given to the treatment of com- 
the fast pace of developrnents in the new area p^^^ networks in terms of discrete mathemat- 
of complex networks 111 U bl Ij is that, j^s and mathematical morphology, which are 
while many impressive and relevant concepts themselves well-established investigation fields, 
and perspectives have been identified and well- Developed mainly by J. Serra and coUabora- 
developed, and promising results been obtained, j^j-s [5] , the area of mathematical morphology 
some interesting issues have received relatively ^imed, through strict mathematical formal- 
little attention. One particularly important point iz^tion, at representing and analyzing the ge- 
is the fact that, despite the major advances 
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ometrical and topological features of discrete in the complex network literature. Such a con- 
mathematical structures, especially regular lat- cept therefore allows us to speak of the degree 
tices such as those underlying digital images | ^. of subgraphs of special interest, such as cycles. 
Mathematical morphology is strongly founded sets of hubs, or the maximum spanning tree of a 
on the discrete operations of complement, di- given complex network. Second, the considera- 
lation and erosion, which can be composed in tion of a series of subsequent dilations, together 
order to obtain a whole series of new operators with the respectively induced distance transform 
with specific properties. At the same time, pre- and rings, allow the further extension of the de- 
vious developments by L. Vincent and H. Hei- gree and clustering coefficient so that a signa- 
jmans ||7l 18|] have shown how the mathemati- ture, instead of the single scalar traditional mea- 
cal morphology framework can be extended to surements, is obtained which can provide infor- 
graphs, allowing not only the precise mathemat- mation about the network connectivity from the 
ical representation and manipulation of those node to the whole graph scales, 
general structures, but also the immediate ac- The potential of such hierarchical extensions 
cess to the wealthy of existing results from for discriminating the connectivity around each 
mathematical morphology. node (or subgraph) can be readily appreciated 
The present article reports on how the appli- by considering the fact that several nodes in 
cation of discrete mathematics, especially math- a complex network will have the same degree 
ematical morpho 



logy m and distance-oriented and clustering coefficient, but very few nodes 

concepts k III Isl, bears the potential not only will share such values calculated for a series of 

for formalization, but also to obtain a series of subsequent neighborhoods. Such an interesting 

new concepts and results. In particular, by con- Mature of the generalized measurements is il- 

sidering the dilations of subnetworks of a net- lustrated in the present article with respect to 

work r and extendingAe concepts of numbers protein-protein interaction networks. 
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of neighbors J|9|; [lOl, lUJ, 112|] and hierarchical 
node degree |13], we show that the traditional jj BASIC CONCEPTS 
concepts of node degree and clustering coeffi- 
cient 111 |4i] can be generalized in two impor- A network T without multiple edges is a dis- 
tant ways. First, the concept of subnetwork di- crete structure composed of a set of nodes V{T) 
lation paves the way to generalize the degree and a set E{T) of edges {u,v) established be- 
and clustering coefficient to any subnetwork of tween specific pairs of nodes of V(T), so that 
r, and not only their specific nodes as adopted the network T is represented as F = {V,E). 



As we consider undirected networks without 
loops, it follows that {u,v) iv,u) and 

(m, m) ^ E{T). Such a network can be conve- 
niently represented in terms of its respective ad- 
jacency matrix K such that each edge (m, v) is 
represented by making K{u, v) = K{v, u) = 
1, while the absence of edge is indicated by 
zero value. A subnetwork ^ of F is any net- 
work such that V{Cl ^ ^(r) and E{Cl ^ 
{{u,v)\{u,v) e E{T) andu,v G V{C)}. Fig- 
ure Ula) illustrates a network F and one of its 
many subnetworks ^, identified by the wider- 
border nodes and wider edges. Particularly in- 
teresting subnetworks of a network F include 
its hubs, outmost nodes (i.e. nodes with low 
degree), as well as its cycles. Special cases 
of subnetworks of F include the empty net- 
work [y = (lS,E = 0), where stands for the 
empty set, networks containing an isolated node 
uTu = {V = {u e V{T)},E = 0), and the 
own original network F. 

The complement of a subnetwork ,^ of F is 
the subnetwork of F such that V{^{^) = 
{u\ueV{T)andu^V{^)} and = 
{{u,v)\{u,v) e E(T) andu,v e V{Cr)}- Fig- 
ure [Hb) illustrates the complement of ^ in F. 
A subnetwork is connected if any of its nodes 
can be reached from any of its other nodes. Two 
subnetworks ( and ^ of F are connected if it is 
possible to reach a node of ^ from a node of 
and vice- versa. The maximal connected subnet- 
works, in the sense of including the largest num- 



ber of nodes, of a network are called connected 
components. The subnetwork in Figure [Ha) is 
not connected but contains two connected com- 
ponents. 

The degree of a node u of F, hence k(u), cor- 
responds to the number of edges attached to that 
node. The degree of a subnetwork ^ of F, hence 
A;(^), is defined as the number of edges implied 
by the dilation of ^, i.e. those edges connecting 
^ to the rest of F. For instance, the degree of the 
subnetwork ^ in Figure [Tl a) is 12. The outmost 
set of a subnetwork ,^ of F is the set of nodes 
^{^) which have unit degree. For simplicity's 
sake, such nodes are henceforth referred to as 
outnodes. The 1 -neighborhood of a node u of 
F, henceforth represented as ni{u), is the set of 
nodes of F which are attached to u, plus node 
u. This concept can be immediately extended 
to express the neighborhood of a subnetwork ^ 
of F, given as the set of nodes of F which are 
connected to ^ plus the nodes 'mV{^). 

ni. COMPLEX NETWORK MORPHOLOGY 

The dilation of a subnetwork ,^ of F is 
defined as the subnetwork S{^) of F having 
^("^(O) = ^i(0 its set of nodes while its 
set of edges include the edges of F found be- 
tween the nodes in ni{^). The erosion of ^, rep- 
resented as e{^), is a subnetwork of F which 
can be defined as the complement of the di- 
lation of Cf, i.e. e{^) = {S{^r))'r- Observe 



(a) (b) (c) (d) 




(e) (f) (g) (h) 

Figure 1: Original network containing a subnetwork (a), identified by wider-border nodes, and its respective 
complement (b), dilation (c), erosion (d), opening (e), closing (f), and distance transform (g), with the 
distances identified by the node border width. The generalized Voronoi tessellation of the two connected 
components of the subnetwork in (a) is shown in (h). 



that the dilation or erosion of T yields T as re- 
sult. Figures [He) and (d) ilustrates the dila- 
tion and erosion of the subnetwork ^ in (a), re- 
spectively. Observe that the erosion eliminated 
one of the connected components of ^. Gen- 
erally, S{e{^)) 7^ e{6{^)), i.e. the dilation can 
not be used to undo an erosion, and vice-versa. 
Observe also that a subnetwork ^ is necessar- 
ily contained or equal to its respective dilation, 
while the erosion of a subnetwork ^ is necessar- 
ily contained or equal to itself. 

The d-dilation of a subnetwork ^ is defined 
as the subnetwork obtained by dilating d times 
the subnetwork ^, i.e.: 

m = m- • • (0 • • •)) (1) 

^ V ' 

d 

Similarly, the d-erosion can be defined as: 



6,(0 = e{e{. . . (0 . . .)) (2) 

^ V ' 

d 

Observe that 5d{^) converges to F as d is 
increased, while ed{C) converges to the empty 
network under similar circumstances. We also 
have that 5d=i+j{0 = 5^(5^(0) = 
and ed=i+j{0 = ^ii^jiO) = ^ji^iiO)- The 
d-degree of a subnetwork ^ is defined as the 
degree of the J-dilation of the network ^, i.e., 

kdio = Hm))- 

It is possible to use combinations of dilations 
and erosions of a subnetwork ^ in order to ob- 
tain new operators such as the opening and clos- 
ing of ^, which are defined as a(^) = 5{£{^)) 
and u;(,^) = e (5 (^)), respectively. Figures [T] 
(e) and (g) illustrate, respectively, the opening 
and closing of the subnetwork ^ in (a). Ob- 



serve that the closing of ^ had as an effect the 
connection of the two components of that sub- 
network, filling the gap between those subnet- 
works. The opening and closing operations are 
idempotent, in the sense that = a{^) 

and uj{uj{^)) = uj{^). It is also interesting to de- 
fine the d-opening of a subnetwork ^, henceforth 
represented as ad{C), corresponding to d ero- 
sions followed by d dilations. The d-closing of 
^, represented as uJd{C) can be defined in similar 
fashion. The latter operator is useful for investi- 
gating the progressive merging of subnetworks 
of r in terms of increasing values of d. Par- 
ticularly, interesting information about the net- 
work structure can be provided by the evolution 
of the number of connected subnetworks, start- 
ing from a specific set x of subnetworks (e.g. 
the network 3-cycles), in terms of a sequence of 
J-openings (or closings) performed for increas- 
ing values of d. 

IV. DISTANCES, DISTANCE TRANSFORMS, 
PARALLELS AND RINGS 

Several important features of a complex net- 
work are related to the concept of distance. If C, 
and ^ are any subnetworks of F, the (minimal) 
distance between the respective set of nodes 
V{Q and V{i), hence D{y{C),V{i)), can be 
defined as the value of d for which some node u 
of C becomes included into V{5d{C))- It can be 
verified that D(l^(C), 1^(0) = D{V{i),V{Q). 
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Observe that D (1^(0, V"(C)) = 0. In particu- 
lar, the distance between a node u and a sub- 
network ^ is given as D{{u}, V{^)). The dis- 
tance transform of a subset of nodes x of F 
is the mapping which assigns D{{u], x) to ev- 
ery node u E V^(F), including those in ^. Fig- 
ure Wig) illustrate the distance transform of the 
subnetwork in (a), with the distance values (4 
values for this subnetwork, i.e. d = 0,1,2, 
and 3) expressed in terms of the node borders 
widths. Given a subnetwork ^ of F, the subnet- 
work g defined by the set V{p) of nodes such 
that D{V{g),V{^)) = d and the set of those 
edges of F connecting nodes in V{g) is called 
the d-parallel of ^, henceforth represented as 
PdiO- The parallels of the subnetwork ^ in Fig- 
ure [TI a) correspond to the set of nodes with the 
same width in (g) plus the respective intercon- 
necting edges. The number of nodes and edges 
in a rf-parallel of ^ are henceforth represented as 
n{Pd{^)} and e{Pd{0}- Similarly, it is inter- 
esting to define the rs-ring of ^, hence -Rrs(0' 
which corresponds to the union of the respec- 
tive parallels of ^ for distances d = r to s plus 
the edges of F interconnecting such parallels. 
The number of nodes and edges in a rs-ring of 
g are henceforth represented as ^{-Rrs(0} 
e{-Rrs(0}- Observe that a J-parallel therefore 
is the particular case of the r^-ring for c? = r = 
s. Another interesting possibility is to use the 
above introduced distance concepts in order to 
obtain the generalized Voronoi tessellation of 



subnetworks, as illustrated in Figure [Tlh) with 
respect to the two connected components in the 
subnetwork in Figure [TJ a). 

The above definitions allow the concept of 
clustering coefficient Q] to be generalized to 
parallels and rings of any subnetwork. The rs- 
clustering coefficient of a subnetwork ^ of F, 
henceforth represented as cCrs{C)^ can be de- 
fined as the number of edges in the respective 
r5-ring subnetwork, divided by the total of pos- 
sible edges between the nodes in that ring, i.e.: 



2e{i?.,(0} 



n{Rrs{mn{Rrs{0] - 1) 



(3) 



V. HIERARCHICAL MEASUREMENTS FOR 
SINGLE NODES 

The concepts discussed above can be natu- 
rally extended to a single node and to an edge, 
respectively, whether the subgraph contains the 
node alone and whether the subgraph contains 
the edge and both nodes connected by the edge 
ll^llJ]. Using the concept of rings considered 
in the last section, henceforth the subgraph Q 
composed of the ring Rd{u) is defined as the hi- 
erarchical level related to the subgraph ^ com- 
posed of the single node (u), such that, the hier- 
archical number of nodes n(i{u) (or n{Rd{u)}) 
is given as the number of nodes at hierarchi- 
cal distance d from de reference node (m), i.e., 
the number of nodes in the ring Rd{u). Hence, 
the hierarchical degree kd{u) is defined as the 



number of edges between the nodes in the ring 
Rd and Rd+i, such that the hierarchical number 
of edges among the nodes in the ring Rd{u) is 
ed{u) (or e{Rd{u)}). The hierarchical cluster- 
ing coefficient ccd{u) is written using equation 
3. 

2ed{v) 



(4) 



nd{u){nd{u) - 1) 
At last, other hierarchical measurements can be 
derived from the above definitions and each of 
them will be dealt with in turn: 

• Convergence ratio {Cd{u)) : Measures 
the ratio between the hierarchical node 
degree of node (n) at hierarchical dis- 
tance d — I and the hierarchical number 
of nodes in the ring Rd{u). 

kd-i{u) 



Cd{u) 



ridW 



(5) 



Intra-ring degree {Ad{u)) : The average 
among the degrees of the nodes in the ring 

Rdiu). 



Adiu) 



ndiu) 



(6) 



Inter-ring degree {Ediu)) : The average 
of the number of connections between 
each node in ring Rdiu) and those in 
Rd+i{u). 

kd{u) 



Ediu) 



ndiu) 



(7) 



Hierarchical common degree {Hd{u)) : 
The average node degree among the 



nodes in Rdiu), considering all edges 
connected to nodes in the ring. 

^ 2ed{u) + kd^i{u) + kd{u) 



A summary of the hierarchical measure- 
ments considered in the present work are 
presented in the table bellow. 



Hier. number of nodes in the 
ring Rd{u). 


nd(u) 


Hier. number of edges among 
the nodes in the ring Rd{u). 


ed(u) 


Hier. degree of node [u) 
at distance d. 


kd(u) 


Hier. clustering coefficient of 
node (m) at hier. level d. 


CCd(u) 


Convergence ratio of node {u) 
at hier. level d. 


Cd(u) 


Intra-ring node degree of node 
{u) at distance d. 


Ad(u) 


Inter-ring node degree of node 
{u) at distance d. 


Ed(u) 


Hier. common degree of node 
(m) at distance d. 


Hd(u) 



Table 1 - Summary of the hierarchical mea- 
surements considered in this work. 



VI. EVALUATION OF DISCRIMINATIVE 
POWER 

Given a complex network, it is possible to 
organize several selected measurements of its 



topology into a feature vector 7? (e.g. list), 
which therefore provides a quantitative descrip- 
tion of properties of the^etwork. Multivariate 
statistical methods (e.g. [6]) can then be applied 
in order to separate such vectors into clusters or 
to identify the category of the network. 

In a similar fashion, it is possible to assign 
a feature vector to individual nodes of the net- 
work, so that they can be characterized and or- 
ganized into classes. Although simple measure- 
ments such as the node degree and clustering co- 
efficients can be used for this purpose, they are 
generally not enough for a discriminative char- 
acterization of nodes at the individual level be- 
cause several nodes in a large network will have 
identical values of such measurements. The hi- 
erarchical extensions of the node degree and 
clustering coefficient, combined with the ancil- 
lary hierarchical measurements described in this 
work, account for substantially enhanced dis- 
crimination of the local properties of the con- 
nectivity around each node, therefore diminish- 
ing the degeneracy of the description. In other 
words, several nodes may have the same imme- 
diate node degree, but it is rather unlikely that 
they will also share other hierarchical degrees. 
At the same time, nodes which do present sim- 
ilar connectivity patterns along the hierarchies 
can be clustered into meaningful classes by con- 
sidering feature vectors composed of hierarchi- 
cal measurements. 

In order to illustrate the above possibilities, 
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we considered a S. cerev/i'/ae protein-protein in- undergoes the three following regimes: (i in- 
teraction network T uG\ containing = 1922 creases along the 4 or 5 initial hierarchies; (ii) 
nodes and without self-connections and isolated stays nearly constant until about 12 hierarchi- 
nodes. A perturbed version of this network was cal levels, and (iii) then decreases steadily. This 
obtained by rewiring the edges with probability behavior is observed for all graphs in Figure El 
p. Nodes in these two networks are then charac- The average performance increase in (i) is a di- 
terized in terms of several combinations of hi- rect consequence of the fact that more informa- 
erarchical measurements. In order to quantify tion about the network connectivity around each 
the discriminative power of such measurements, node is being taken into account. The perfor- 
we repeatedly selected a node from the origi- mance plateau and decrease taking place after 
nal network and identified among all nodes of 4 or 5 hierarchical levels are considered for the 
the perturbed network the node which leads to measurements reflects a degeneration in the dis- 
the smallest Euclidean distance between the re- criminative power of the measurements caused 
spective feature vectors. In case these two nodes by the fact that most of the nodes have been con- 
are verified to indeed correspond one another sidered at such hierarchical depths. Similar per- 
(recall that the identity of the nodes is guaran- formances are observed for the four cases illus- 
teed because the perturbed network is derived trated in Figure|2l with slightly better results be- 
from the original network by rewiring), we un- ing achieved for the measurement combinations 
derstand there has been a correct identification, in (a) and (d). The standard deviations values 
Among the several combinations of mea- ^^^d to follow the mean, with higher variations 
surements, considering varying hierarchical lev- ^^^"S observed along the plateaux, 
els, the best results were obtained for pairwise 

combinations of A^, E^, and Ca, particulalry CONCLUDING REMARKS 

the four situations shown in Figure El The dia- 
grams in this figure depict the average ± stan- This article has addressed several issues re- 

dard deviation of the percentage of correctly garding the generalization of complex networks 

identified nodes by using the identified pairs of measurements. First, we have shown for the first 

measurements up to the hierarchical levels iden- time that complex networks and their proper- 

tified in the x-axis (varying from 1 to 19). A ties can be formalized in terms of mathemati- 

number of interesting features can be identified cal morphology, allowing the definition of a se- 

from such results. First, it is interesting to no- ries of measurements such as the generalized 

tice that the average of correct identifications versions of the node degree and clustering co- 



0.S- 
0.8- 




Figure 2: Four hierarchical measurements combinations for the feature vector: (a) and E^; (b) and 
Hd.; (c) Cd, Ad and Hd and (d) Ad and Hd. 



efficient, as well as the possibility to use other 
features from mathematical morphology so as to 
investigate further the structure of specific sub- 
networks. Second, we have emphasized the im- 
portance of identifying and studying the proper- 
ties of subnetworks of special interest — includ- 
ing the set of hubs, outnodes and 3 -cycles, and 
shown that a particularly comprehensive study 
of such subnetworks can be obtained by taking 
into account a whole series of neighborhoods, as 
allowed by the the novel proposed concepts of 
generalized degrees and clustering coefficient. 

While the new set of measurements extended 
to take into account subgraphs and hierarchies 
can be used to derive new network growth 
schemes and characterize and classify differ- 



ent types of networks, they also present power 
for enhanced discrimination between individual 
nodes. The latter has been illustrated for the 
first time in this article with respect to protein- 
protein interaction networks. More specifically, 
we have shown that the ability to identify cor- 
respondenced between nodes in two versions of 
a network (in the case of our example the origi- 
nal and perturbed networks) tend to increase by 
considering measurements taking into account 
multiple hierarchical levels. This is a conse- 
quence of the fact that the use of more hierar- 
chical levels allows the measurements to reflect 
in a less degenerate way the network connectiv- 
ity around each node. At the same time, we have 
shown that such enhanced discriminative power 



tends, with the incorporation of additional hier- 
archical depths, to reach a plateau and then to 
decrease. 

The possibilities for future works include the 
application of the introduced concepts to com- 
munity finding, characterization of resilience to 
attack, and extensions to measurements aimed 
at characterizing the assortative properties of 
networks. 
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